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DETECTION OF CFP2ZM POLYMORPHISMS 

The present invention is directed to detection of certain polymorphisms in 
the 5' regulatory region of the gene encoding cytochrome P450 2D6, also known as 
5 CYP2D6, bufuralol-1 -hydroxylase, or debrisoquine/sparteine hydroxylase, to 
predict variations in an individual's ability to metabolize certain drugs. 

BACKGROUND OF THE INVENTION 
Xenobiotics are pharmacologically, endocrinologically, or toxicologically 
active substances foreign to a biological system. Most xenobiotics, including 

10 pharmaceutical agents, are metabolized through two successive reactions. Phase I 
reactions (functionalization reactions), include oxidation, reduction, and hydrolysis, 
in which a derivatizable group is added to the original molecule. Functionalization 
prepares the drug for further metabolism in phase II reactions. During phase II 
reactions (conjugative reactions, which include glucoronidation, sulfation, 

15 methylation and acetylation), the functionalized drug is conjugated with a 

hydrophilic group. The resulting hydrophilic compounds are inactive and excreted 
in bile or urine. Thus, metabolism can result in detoxification and excretion of the 
active substance. Alternatively, an inert xenobiotic may be metabolized to an active 
compound. For example, a pro-drug may be converted to a biologically active 

20 therapeutic or toxin. 

The cytochrome P450 (CYP) enzymes are involved in the metabolism of 
many different xenobiotics. CYPs are a superfamily of heme-containing enzymes, 
found in eukaryotes (both plants and animals) and prokaryotes, and are responsible 
for Phase I reactions in the metabolic process. In total, over 500 genes belonging to 

25 the CYP superfamily have been described and divided into subfamilies, CYP1- 
CYP27. In humans, more than 35 genes and 7 pseudogenes have been identified. 
Members of three CYP gene families, CYP1, CYP2, and CYP3, are responsible for 
the majority of drug metabolism. The human CYPs which are of greatest clinical 
relevance for the metabolism of drugs and other xenobiotics are CYP1 A2, CYP2A6, 

30 CYP2C9, CYP2C19, CYP2D6, CYP2E1 and CYP3A4. The liver is the major site 
of activity of these enzymes, however CYPs are also expressed in other tissues. 

Approximately 20% of known drugs are substrates for CYP2D6, and thus the 
metabolism of these drugs is wholly or partially mediated by this enzyme. For 
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example, a variety of antidepressants including the tricyclic antidpressants and the 
SSRIs (serotonin reuptake inhibitors) are substrates of CYP2D6. Antipsychotics 
such as haloperidol, perphenazine, thioridazine, and zuclopenthixol are also 
substrates of CYP2D6. In addition, CYP2D6 hydroxylates P-adrenoreceptor 
5 blocking agents such as propranolol, metoprolol, and timolol and anti-arrhythmic 
drugs such as sparteine, diprafenone, and propafenone. Codeine is hydroxylated to 
morphine by CYP2D6. 

CYP2D6 is a polymorphic enzyme, that is, more than one form of the 
enzyme is present within the human population. The different forms of the CYP2D6 
10 enzyme have differing abilities to hydroxylate substrates, which impacts on the rate 
p at which the substrates are removed from the body. The form of CYP2D6 that an 

^ individual inherits will determine how quickly a substrate is removed from the 

M individual's body. Because CYP2D6 is polymorphic, individuals differ in their 

I J| ability to metabolize the drugs that are substrates of CYP2D6, and consequently, 

H 15 wide variations in responses to such drugs, including susceptibility to side effects, 

;** have been observed. 

Wl On the basis of ability of metabolize a marker drug such as debrisoquine or 

n\ sparteine, individuals may be characterized as poor metabolizers (PM), intermediate 

*f metabolizers (IM), extensive metabolizers (EM) or ultra extensive metabolizers 

o 

jl 20 (UEM or UM) for CYP2D6 substrates. Poor metabolizers retain the CYP2D6 

substrate in their bodies for a relatively long period of time, and are susceptible to 
toxicity and side effects at "normal" dosages, Ultraextensive metabolizers clear the 
CYP2D6 substrate from their bodies quickly, and require higher than "normal" 
dosages to achieve a therapeutic effect. Intermediate and extensive metabolizers 

25 retain the CYP2D6 substrate in their bodies for times between those of PMs and 
UEMs, and are more likely to respond to "normal" dosages of the drug. However, 
individuals characterized as IM or EM may differ in drug clearance by as much as 
80-fold, and variations in toxicity, side effects, and efficacy for a particular drug 
may occur among these individuals. 

30 The existence of more than one form of the CYP2D6 enzyme is caused by 

polymorphisms in the gene which encodes the CYP2D6 enzyme (the gene being 
denoted in italics, as CYP2D6, SEQ ID NO:l). In fact, more than 30 
polymorphisms in the CYP2D6 gene have been described (see 



1 



id 



http://www.imm.ki.se/cypalleles/ for listing). The frequency of a particular CYP2D6 
polymorphism may differ widely among ethnic groups, with concomitant 
differences in CYP2D6 activity and responses to drugs which are CYP2D6 
substrates. The frequencies of CYP2D6 mutations in European populations are 
5 presented in Marez, et al (1997) Pharmacogenetics 7, 193-202 and Sachse, et al 
(1997) Am. J. Hum. Genet 60, 284-295. The most common polymorphisms are 
CYP2D6*1A, CYP2D6*2, CYP2D6*2B, CYP2D6HA, and CYP2D6*5, which 
account for about 87% of all CYP2D6 alleles in Europeans. CYP2D6HA encodes 
an active enzyme and is commonly known as the wild type gene. CYP2D6*2 and 

10 CYP2D6*2B encode a functional enzyme which has slightly decreased activity. 

CYP2D6HA includes a G to A substitution at position 3465 of SEQ ID NO:l, which 
results in a splicing defect and a truncated, inactive protein, and CYP2D6*5 is a 
deletion of the entire CYP2D6 gene, resulting in no CYP2D6 enzyme activity. 

Polymorphisms which result in a defective or absent CYP2D6 enzyme are 

15 generally correlated with the PM phenotype. A number of efforts have been made to 
detect these polymorphisms in order to predict an individual's response to CYP2D6 
substrates without administering a potentially toxic drug. For example, WO 
91/10745 discloses a method of identifying mutations at one or more of positions 
100, 271, 281, 294, 408, 506, 775, or 1432 of CYP2D6, to distinguish PMs from 

20 EMs. The numbering of the CYP2D6 sequence employed in WO 91/10745 began at 
the initiation codon and thus did not include the 5' flanking region of the gene. 

U.S.Pat.No. 5,648,482 and corresponding EP 463 395 Bl disclose 
polymerase chain reaction (PCR) primers for specifically amplifying alleles of the 
CYP2D6 gene, for detection of PMs. The PCR primers of U.S.Pat.No. 5,648,482 

25 and EP 463 395 B 1 are complementary to intronic sequences unique to CYP2D6. 

EP 759 476 Al discloses PCR primers and methods for detecting a nine base 
pair insertion in exon 9 of CYP2D6, useful for detecting PMs. 

The UEM phenotype is generally correlated with amplifications of functional 
CYP2D6 genes. Bertilsson et al (1996) CNS Drugs 5, 200-223 discloses that such 

30 amplifications include duplications and triplications, though up to 13 copies of the 
CYP2D6 gene have been found in some families. Lundqvist, et al (1999) Gene 226, 
327-338, discloses that 7% of Spaniards, 29% of Ethiopians, and 20% of Saudi 
Arabians have duplicated or multiduplicated CYP2D6 genes. Bertilsson et al also 



discloses that a duplicated or amplified CYP2D6L2 (CYP2D6*2XN) allele is present 
in about 1-2% of the Swedish Caucasian population, and that this allele is present 
only in about 40% of individuals with a metabolic ratio of less than 0.1. Thus 
CYP2D6 gene amplification may not explain the genetics of all CYP2D6 UEM, and 
5 additional methods of detecting such individuals are needed. 

Lundqvist, et al discloses several mutations in the 5' flanking region of the 
CYP2D6 gene, including a C to G substitution at -1496, a 5 A insertion between - 
1 149 and -1 148, an A to G substitution at -1 147, a C to T substitution at -653, and a 
G to A substitution at -591. Raimundo, et al (1999) Eur. J. Clin. Pharmacol 55, 

10 A5 discloses seven point mutations in the 5' flanking region of the CYP2D6 gene in 
an abstract describing a study to characterize inter-individual metabolic capacity in 
EMs. The mutations disclosed were: -234 (C to T), -590 (A to G), -652 (T to C), - 
912 (A to G), -1 147 (G to A), -1338 (T to C), and -1496 (G to C). In a later 
publication, Raimundo, et al (2000) Pharmacogenetics 10, 577-581, the mutations 

15 at -1496, -652, and -590 were disclosed to be exclusively associated with the 

functional CYP2D6*2 allele, and the mutations at -1338 and -912 were disclosed to 
be associated with the nonfunctional CYP2D6H allele and the functional 
CYP2D6H0 allele. The mutation at - 1 147 was found in all alleles investigated. On 
the basis of an association between the mutation at -1496 and the EM phenotype, in 

20 particular for individuals having the CYP2D6*2/CYP2D6*0 genotype, Raimundo et 
al proposed that that at least 50-60 % of all IMs could be predicted. 

U.S.Pat.No. 6,045,996 discloses oligonucleotide arrays including the 
complete coding sequence of the CYP2D6 gene, exon by exon, including probes to 
detect specifically known polymorphisms. 

25 Because of the complexity of the CYP2D6 genetic locus and the impact of 

the CYP2D6 enzyme on drug metabolism, additional diagnostic or prognostic 
methods and tools are needed. Such methods and tools will be useful in predicting 
an individual's likely response to a drug and in selecting subjects for clinical trials. 
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SUMMARY OF THE INVENTION 

The present inventors have discovered that individuals who are homozygous 
or heterozygous for certain haplotypes consisting of polymorphic sites in the 5' 
flanking region of the CYP2D6 gene exhibit characteristic metabolic ratios for 
debrisoquine. Using this information, the capacity of individuals to metabolize 
drugs which are substrates of the CYP2D6 enzyme may be predicted by genotyping 
those polymorphisms. 

In one embodiment, the invention provides a method for determining a 
human's capacity to metabolize a substrate of a CYP2D6 enzyme, said method 
comprising the steps of: isolating single stranded nucleic acids from the human, said 
nucleic acids encoding 5' flanking regions of CYP2D6 genes present on each 
homologous chromosome 22 of the human, wherein said region is represented by a 
sequence as set forth in SEQ ID NO;2; and detecting at least three polymorphisms 
within the region, wherein the polymorphisms are selected from the group consisting 
of nucleotides present at polymorphic sites represented by positions 36, 194, and 
942 of SEQ ID NO:2; nucleotides at polymorphic sites represented by positions 36, 
620, and 942 of SEQ ID NO:2; nucleotides at polymorphic sites represented by 
positions 36, 194, and 880 of SEQ ID NO:2; nucleotides at polymorphic sites 
represented by positions 36, 620, and 880 of SEQ ID NO:2; nucleotides at 
polymorphic sites represented by positions 36, 194, 620, and 880 of SEQ ID NO:2; 
nucleotides at polymorphic sites represented by positions 36, 194, 620, and 942 of 
SEQ ID NO:2; nucleotides at polymorphic sites represented by positions 36, 620, 
880, and 942 of SEQ ID NO:2; and nucleotides at polymorphic sites represented by 
positions 36, 194, 620, 880, and 942 of SEQ ID NO:2. 

In another embodiment, the invention provides a sequence determination 
oligonucleotide suitable for detecting polymorphic sites in a 5' flanking region of a 
CYP2D6 gene, said oligonucleotide comprising a sequence selected from the group 
consisting of a sequence complementary to the polymorphic region corresponding to 
position 36 of SEQ ID NO:2; a sequence complementary to the polymorphic region 
corresponding to position 194 of SEQ ID NO:2; a sequence complementary to the 
polymorphic region corresponding to position 620 of SEQ ID NO:2; a sequence 
complementary to the polymorphic region corresponding to position 880 of SEQ ID 



N0:2; and a sequence complementary to the polymorphic region corresponding to 
position 942 of SEQ ID NO:2. 

In another embodiment, the invention provides an oligonucleotide primer 
pair suitable for amplifying a 5' flanking region of a CYP2D6 gene, said primer pair 
5 comprising sequences selected from the group consisting of: SEQ ID NO: 17 and 
SEQ ID NO: 18; SEQ ID NO: 19 and SEQ ID NO:20; SEQ ID NO:21 and SEQ ID 
NO-.22; SEQ ID NO:23 and SEQ ID NO:24; SEQ ID NO:25 and SEQ ID NO:26; 
SEQ ID NO:27 and SEQ ID NO:28; SEQ ID NO:29 and SEQ ID NO:30; SEQ ID 
NO:31 and SEQ ID NO:32; SEQ ID NO:33 and SEQ ID NO:34; and SEQ ID 

10 NO:35 and SEQ ID NO: 18. 

In another embodiment, the invention provides a kit comprising at least three 
pairs of oligonucleotide primers suitable for amplifying a 5' flanking region of a 
CYP2D6 gene, said primer pairs being selected from the group consisting of SEQ ID 
NO: 17 and SEQ ID NO: 18; SEQ ID NO: 19 and SEQ ID NO:20; SEQ ID NO:21 

15 and SEQ ID NO:22; SEQ ID NO:23 and SEQ ID NO:24; SEQ ID NO:25 and SEQ 
ID NO:26; SEQ ID NO:27 and SEQ ID NO:28; SEQ ID NO:29 and SEQ ID NO:30; 
SEQ ID NO:31 and SEQ ID NO:32; SEQ ID NO:33 and SEQ ID NO:34; and SEQ 
ID NO:35 and SEQ ID NO: 18; and at least three sequence determination 
oligonucleotides, said oligonucleotides comprising sequences selected from the 

20 group consisting of: SEQ ID NO:3; SEQ ID NO: 10; SEQ ID NO:36; SEQ ID 

NO:37; SEQ ID NO:50; SEQ ID NO:57; SEQ ID NO:64; SEQ ID NO:71 (for PCR- 
products amplified with SEQ ID NO:35 and SEQ ID NO: 18) for polymorphic 
position 36; SEQ ID NO:4; SEQ ID NO:ll; SEQ ID NO:38; SEQ ID NO:39; SEQ 
ID NO:51; SEQ ID NO:58; SEQ ID NO:65; SEQ ID NO:72 (for PCR-products 

25 amplified with SEQ ID NO: 17 and SEQ ID NO: 18, or SEQ ID NO:31 and SEQ ID 
NO:32) for polymorphic position 194; SEQ ID NO:5; SEQ ID NO: 12; SEQ ID 
NO:40; SEQ ID NO:41; SEQ ID NO:52; SEQ ID NO:59; SEQ ID NO:66; SEQ ID 
NO:73 (for PCR-products amplified with SEQ ID NO:29 and SEQ ID NO:30, or 
SEQ ID NO:33 and SEQ ID NO:34) for polymorphic position 385; SEQ ID NO:6; 

30 SEQ ID NO:13; SEQ ID NO:42; SEQ ID NO:43; SEQ ID NO:53; SEQ ID NO:60; 
SEQ ID NO:67; SEQ ID NO:74 (for PCR-products amplified with SEQ ID NO:27 
and SEQ ID NO:28, or SEQ ID NO:29 and SEQ ID NO:30) for polymorphic 
position 620; SEQ ID NO:7; SEQ ID NO: 14; SEQ ID NO:44; SEQ ID NO:45; SEQ 
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ID NO:54; SEQ ID N0:61; SEQ ID NO:68; SEQ ID NO:75 (for PCR-products 
amplified with SEQ ID NO: 19 and SEQ ID NO:20, or SEQ ID NO:25 and SEQ ID 
NO:26) for polymorphic position 880; SEQ ID NO:8; SEQ ID NO: 15; SEQ ID 
NO:46; SEQ ID NO:47; SEQ ID NO:55; SEQ ID NO:62; SEQ ID NO:69; SEQ ID 
5 NO:76 (for PCR-products amplified with SEQ ID NO: 19 and SEQ ID NO:20, or 
SEQ ID NO:25 and SEQ ID NO:26) for polymorphic position 942; and SEQ ID 
NO:9; SEQ ID NO: 16; SEQ ID NO:48; SEQ ID NO:49; SEQ ID NO:56; SEQ ID 
NO:63; SEQ ID NO:70; SEQ ID NO:77 (for PCR-products amplified with SEQ ID 
NO:21 and SEQ ID NO:22, or SEQ ID NO:23 and SEQ ID NO:24) for polymorphic 
10 position 1255. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the sequence of the CYP2D6 gene as set forth in SEQ ID 

15 NO:l. 

Figure 2 shows the 5' flanking region of the CYP2D6 gene as set forth in 
SEQ ID NO:2 with polymorphic sites highlighted in bold. 

Figure 3 outlines the One Base Sequencing (OBS) method of SNP detection. 

20 
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DETAILED DESCRIPTION OF THE INVENTION 

The U.S. patents and publications referenced herein are hereby incorporated 
by reference. 

5 For the purposes of the invention, certain terms are defined as follows. 

"Gene" is defined as the genomic sequence of the CYP2D6 gene. 
"Oligonucleotide" means a nucleic acid molecule preferably comprising 
from about 8 to about 50 covalently linked nucleotides. More preferably, an 
oligonucleotide of the invention comprises from about 8 to about 35 nucleotides. 

10 Most preferably, an oligonucleotide of the invention comprises from about 10 to 
about 25 nucleotides. In accordance with the invention, the nucleotides within an 
oligonucleotide may be analogs or derivatives of naturally occurring nucleotides, so 
long as oligonucleotides containing such analogs or derivatives retain the ability to 
hybridize specifically within the polymorphic region containing the targeted 

15 polymorphism. Analogs and derivatives of naturally occurring oligonucleotides 
within the scope of the present invention are exemplified in U.S. Pat. Nos, 
4,469,863; 5,536,821; 5,541,306; 5,637,683; 5,637,684; 5,700,922; 5,717,083; 
5,719,262; 5,739,308; 5,773,601; 5,886,165; 5,929,226; 5,977,296; 6,140,482; WO 
00/56746; WO 01/14398, and the like. Methods for synthesizing oligonucleotides 

20 comprising such analogs or derivatives are disclosed, for example, in the patent 
publications cited above and in U.S. Pat. Nos. 5,614,622; 5,739,314; 5,955,599; 
5,962,674; 6,117,992; in WO 00/75372, and the like. The term "oligonucleotides" 
as defined herein includes compounds which comprise the specific oligonucleotides 
disclosed herein, covalently linked to a second moiety. The second moiety may be 

25 an additional nucleotide sequence, for example, a tail sequence such as a 

polyadenosine tail or an adaptor sequence, for example, the phage M13 universal tail 
sequence, and the like. Alternatively, the second moiety may be a non-nucleotidic 
moiety, for example, a moiety which facilitates linkage to a solid support or a label 
to facilitate detection of the oligonucleotide. Such labels include, without limitation, 

30 a radioactive label, a fluorescent label, a chemiluminescent label, a paramagnetic 
label, and the like. The second moiety may be attached to any position of the 
specific oligonucleotide, so long as the oligonucleotide retains its ability to hybridize 
to the polymorphic regions described herein. 
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A polymorphic region as defined herein is a portion of a genetic locus that is 
characterized by at least one polymorphic site. A genetic locus is a location on a 
chromosome which is associated with a gene, a physical feature, or a phenotypic 
trait. A polymorphic site is a position within a genetic locus at which at least two 
5 alternative sequences have been observed in a population. A polymorphic region as 
defined herein is said to "correspond to" a polymorphic site, that is, the region may 
be adjacent to the polymorphic site on the 5' side of the site or on the 3' side of the 
site, or alternatively may contain the polymorphic site. A polymorphic region 
includes both the sense and antisense strands of the nucleic acid comprising the 

10 polymorphic site, and may have a length of from about 100 to about 5000 base pairs. 
For example, a polymorphic region may be all or a portion of a regulatory region 
such as a promoter, 5' UTR, 3' UTR, an intron, an exon, or the like. A polymorphic 
or allelic variant is a genomic DNA, cDNA, mRNA or polypeptide having a 
nucleotide or amino acid sequence that comprises a polymorphism. A 

15 polymorphism is a sequence variation observed at a polymorphic site, including 
nucleotide substitutions (single nucleotide polymorphisms or SNPs), insertions, 
deletions, and microsatellites. Polymorphisms may or may not result in detectable 
differences in gene expression, protein structure, or protein function. Preferably, a 
polymorphic region of the present invention has a length of about 1000 base pairs. 

20 More preferably, a polymorphic region of the invention has a length of about 500 
base pairs. Most preferably, a polymorphic region of the invention has a length of 
about 200 base pairs. 

A haplotype as defined herein is a representation of the combination of 
polymorphic variants in a defined region within a genetic locus on one of the 

25 chromosomes in a chromosome pair. A genotype as used herein is a representation 
of the polymorphic variants present at a polymorphic site. 

Methods of predicting an individual human's capacity to metabolize drugs 
which are substrates for the CYP2D6 enzyme are encompassed by the present 
invention. In the methods of the invention, the presence or absence of at least three 

30 polymorphic variants of the nucleic acid of SEQ ID NO:2 are detected to determine 
the individual's haplotype for those variants. Specifically, in a first step, a nucleic 
acid is isolated from biological sample obtained from the human. Any nucleic-acid 
containing biological sample from the human is an appropriate source of nucleic 
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acid for use in the methods of the invention. For example, nucleic acid can be 
isolated from blood, saliva, sputum, urine, cell scrapings, biopsy tissue, and the like. 
In a second step, the nucleic acid is assayed for the presence or absence of at least 
three allelic variants of the polymorphic regions of the nucleic acid of SEQ ID NO:2 
5 described above. Specifically, a haplotype is constructed for at least three 

polymorphic sites in the 5' regulatory region of the CYP2D6 gene in the method of 
the invention. The polymorphic sites may be selected from the group consisting of 
positions 36, 194, and 942 of SEQ ID NO:2; positions 36, 620, and 942 of SEQ ID 
NO:2; positions 36, 194, and 880 of SEQ ID NO:2; positions 36, 620, and 880 of 
10 SEQ ID NO:2; positions 36, 194, 620, and 942 of SEQ ID NO:2; positions 36, 620, 

y 880, and 942 of SEQ ID NO:2, or positions 36, 194, 620, 880, and 942 of SEQ ID 

#: NO:2. Preferably, at least three polymorphic sites on each chromosome in the 

chromosome pair of the human are assayed in the method of the invention, so that 

W the zygosity of the individual for the particular polymorphic variant may be 

n 15 determined. 

^ Any method may be used to assay the nucleic acid, that is, to determine the 

m sequence of the polymorphic region, in this step of the invention. For example, any 

^ of the primer extension-based methods, ligase-based sequence determination 

p methods, mismatch-based sequence determination methods, sequencing methods, or 

p " 20 microarray-based sequence determination methods described above may be used, in 
accordance with the present invention. Alternatively, such methods as restriction 
fragment length polymorphism (RFLP) detection, single strand conformation 
polymorphism detection (SSCP), PCR-based assays such as the Taqman® PCR 
System (Applied Biosystems) may be used. 
25 The oligonucleotides of the invention may be used to determine the sequence 

of the polymorphic regions of SEQ ID NO:2. In particular, oligonucleotides within 
the scope of the present invention may comprise any of the sequences as set forth in 
SEQ ID NO:3, SEQ ED NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ 
ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:ll, SEQ ID NO:12, SEQ ID 
30 NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:36, SEQ ID 
NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID 
NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID 
NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:51, SEQ ID 



NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ ID 
NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID N0:61, SEQ ID 
NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID 
NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID N0:71, SEQ ID 
5 NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76 and SEQ 
IDNO:77. 

Those of ordinary skill will recognize that oligonucleotides complementary 
to the polymorphic regions described herein must be capable of hybridizing to the 
polymorphic regions under conditions of stringency such as those employed in 

10 primer extension-based sequence determination methods, restriction site analysis, 
nucleic acid amplification methods, ligase-based sequencing methods, methods 
based on enzymatic detection of mismatches, microarray-based sequence 
determination methods, and the like. The oligonucleotides of the invention may be 
synthesized using known methods and machines, such as the ABI™3900 High 

15 Throughput DNA Synthesizer and the Expedite™ 8909 Nucleic Acid Synthesizer, 
both of which are available from Applied Biosy stems (Foster City,CA). 

The oligonucleotides of the invention may be used, without limitation, as in 
situ hybridization probes or as components of diagnostic assays. Numerous 
oligonucleotide-based diagnostic assays are known. For example, primer extension- 

20 based nucleic acid sequence detection methods are disclosed in U.S.Pat.Nos. 
4,656,127; 4,851,331; 5,679,524; 5,834,189; 5,876,934; 5,908,755; 5,912,118; 
5,976,802; 5,981,186; 6,004,744; 6,013,431; 6,017,702; 6,046,005; 6,087,095; 
6,210,891; WO 01/20039; and the like. Primer extension-based nucleic acid 
sequence detection methods using mass spectrometry are described in U.S.Pat.Nos. 

25 5,547,835; 5,605,798; 5,691,141; 5,849,542; 5,869,242; 5,928,906; 6,043,031; 

6,194,144, and the like. The oligonucleotides of the invention are also suitable for 
use in ligase-based sequence determination methods such as those disclosed in 
U.S.Pat.Nos. 5,679,524 and 5,952,174, WO 01/27326, and the like. The 
oligonucleotides of the invention may be used as probes in sequence determination 

30 methods based on mismatches, such as the methods described in U.S.Pat.Nos. 
5,851,770; 5,958,692; 6,110,684; 6,183,958; and the like. In addition, the 
oligonucleotides of the invention may be used in hybridization-based diagnostic 
assays such as those described in U.S.PatNos. 5,891,625; 6,013,499; and the like. 

11 



The oligonucleotides of the invention may also be used as components of a 
diagnostic microarray. Methods of making and using oligonucleotide microarrays 
suitable for diagnostic use are disclosed in U.S.Pat.Nos, 5,492,806; 5,525,464; 
5,589,330; 5,695,940; 5,849,483; 6,018,041; 6,045,996; 6,136,541; 6,142,681; 
5 6,156,501; 6,197,506; 6,223,127; 6,225,625; 6,229,911; 6,239,273; WO 00/52625; 
WO 01/25485; WO 01/29259; and the like. 

Each of the PCR primer pairs of the invention may be used in any PCR 
method. For example, a PCR primer pair of the invention may be used in the 
methods disclosed in U.S.Pat.Nos. 4,683,195; 4,683,202, 4,965,188; 5,656,493; 
10 5,998,143; 6,140,054; WO 01/27327; WO 01/27329; and the like. The PCR pairs of 
the invention may also be used in any of the commercially available machines that 
3 perform PCR, such as any of the GeneAmp® Systems available from Applied 

£j Biosystems. 

J: The invention is also embodied in a kit comprising at least three 

|i| 15 oligonucleotide primer pairs of the invention. Preferably, the kit of the invention 

comprises at least five oligonucleotide primer pairs, wherein each primer pair is 
$ capable of amplifying a different polymorphic region of the nucleic acid of SEQ ID 

JJ NO:2, said polymorphic regions corresponding to positions 36, 194, 620, 880, and 

fli 942 of SEQ ID NO:2. More preferably, the kit of the invention comprises at least 

S 20 four oligonucleotide primer pairs suitable for amplification of polymorphic regions 

corresponding to positions 36, 194, 620, and 880 of SEQ ID NO:2; or at least four 
oligonucleotide primer pairs suitable for amplification of polymorphic regions 
corresponding to positions 36, 194, 620, and 942 of SEQ ID NO:2; or at least four 
oligonucleotide primer pairs suitable for amplification of polymorphic regions 
25 corresponding to positions 36, 620, 880, and 942 of SEQ ID NO:2. Most preferably, 
the kit of the invention comprises at least three oligonucleotide primer pairs suitable 
for amplification of polymorphic regions corresponding to positions 36, 194, and 
942 of SEQ ID NO:2; or at least three oligonucleotide primer pairs suitable for 
amplification of polymorphic regions corresponding to positions 36, 194, and 880 of 
30 SEQ ID NO:2; or at least three oligonucleotide primer pairs suitable for 

amplification of polymorphic regions corresponding to positions 36, 620, and 942 of 
SEQ ID NO:2; or at least three oligonucleotide primer pairs suitable for 
amplification of polymorphic regions corresponding to positions 36, 620, and 880 of 



SEQ ID N0:2. This embodiment may optionally further comprise a sequence 
determination oligonucleotide for detecting a polymorphic variant at any or all of the 
polymorphic sites corresponding to positions 36, 194, 620, 880, and 942 of SEQ ID 
NO:2. The kit of the invention may also comprise a polymerizing agent, for 
5 example, a thermostable nucleic acid polymerase such as those disclosed in 

U.S.Pat.Nos. 4,889,818; 6,077,664, and the like. The kit of the invention may also 
comprise chain elongating nucleotides, such as dATP, dTTP, dGTP, dCTP, and 
dITP, including analogs of dATP, dTTP, dGTP, dCTP and dITP, so long as such 
analogs are substrates for a thermostable nucleic acid polymerase and can be 
10 incorporated into a growing nucleic acid chain. The kit of the invention may also 
P i include chain terminating nucleotides such as ddATP, ddTTP, ddGTP, ddCTP, and 

€1 the like. In a preferred embodiment, the kit of the invention comprises at least three 

IS oligonucleotide primer pairs, a polymerizing agent, chain elongating nucleotides, at 

j ^ least three sequence determination oligonucleotides and at least one chain 

y.„ 15 terminating nucleotide. The kit of the invention may optionally include buffers, 

M vials, microtiter plates, and instructions for use. 

Q In one specific embodiment, the invention provides a kit comprising a pair of 

Jf oligonucleotide primers suitable for amplifying the polymorphic region 

y3 corresponding to position 36 of the CYP2D6 gene 5' flanking region as set forth in 

ri 

j£ 20 SEQ ID NO:2, a primer pair suitable for amplifying the polymorphic region 

corresponding to position 194 of the CYP2D6 gene 5' flanking region as set forth in 
SEQ ID NO:2; a primer pair suitable for amplifying the polymorphic region 
corresponding to position 942 of the CYP2D6 gene 5' flanking region as set forth in 
SEQ ID NO;2; a sequence determination oligonucleotide comprising a sequence 

25 selected from the group consisting of SEQ ID NO:3; SEQ ID NO: 10; SEQ ID 

NO:36; SEQ ID NO:37; SEQ ID NO:50; SEQ ID NO:57; SEQ ID NO;64; and SEQ 
ID NO:71; a sequence determination oligonucleotide comprising a sequence selected 
from the group consisting of SEQ ID NO:4; SEQ ID NO: 1 1 ; SEQ ID NO:38; SEQ 
ID NO:39; SEQ ID NO:51; SEQ ID NO:58; SEQ ID NO:65; and SEQ ID NO:72; 

30 and a sequence determination oligonucleotide comprising a sequence selected from 
the group consisting of SEQ ID NO:8; SEQ ID NO:15; SEQ ID NO:46; SEQ ID 
NO:47; SEQ ID NO:55; SEQ ID NO:62; SEQ ID NO:69; and SEQ ID NO:76. The 
primer pairs of this embodiment are preferably selected from the group consisting of 

13 



SEQ ID NO:35 and SEQ ID NO: 18 (for amplification of the polymorphic region 
corresponding to position 36 of SEQ ID NO:2); SEQ ID NO:17 and SEQ ID NO:18; 
SEQ ID NO:31 and SEQ ID NO:32 (for amplification of the polymorphic region 
corresponding to position 194 of SEQ ID NO:2); SEQ ID NO: 19 and SEQ ID 
5 NO:20; and SEQ ID NO:25 and SEQ ID NO:26 (for amplification of the 
polymorphic region corresponding to position 942 of SEQ ID NO:2.) 

Alternatively, the invention may be specifically embodied in a kit comprising 
a pair of oligonucleotide primers suitable for amplifying the polymorphic region 
corresponding to position 36 of the CYP2D6 gene 5' flanking region as set forth in 

10 SEQ ID NO:2, a primer pair suitable for amplifying the polymorphic region 

corresponding to position 194 of the CYP2D6 gene 5' flanking region as set forth in 
SEQ ID NO:2; a primer pair suitable for amplifying the polymorphic region 
corresponding to position 880 of the CYP2D6 gene 5' flanking region as set forth in 
SEQ ID NO:2; a sequence determination oligonucleotide comprising a sequence 

15 selected from the group consisting of SEQ ID NO:3; SEQ ID NO: 10; SEQ ID 

NO:36; SEQ ID NO:37; SEQ ID NO:50; SEQ ID NO:57; SEQ ID NO:64; and SEQ 
ID NO:71; a sequence determination oligonucleotide comprising a sequence selected 
from the group consisting of SEQ ID NO:4; SEQ ID NO: 1 1; SEQ ID NO:38; SEQ 
ID NO:39; SEQ ID NO:51; SEQ ID NO:58; SEQ ID NO:65; and SEQ ID NO:72; 

20 and a sequence determination oligonucleotide comprising a sequence selected from 
the group consisting of SEQ ID NO:7; SEQ ID NO: 14; SEQ ID NO:44; SEQ ID 
NO:45; SEQ ID NO:54; SEQ ID NO:61; SEQ ID NO:68; SEQ ID NO:75. The 
primer pairs of this embodiment are preferably selected from the group consisting of 
SEQ ID NO:35 and SEQ ID NO: 18 (for amplification of the polymorphic region 

25 corresponding to position 36 of SEQ ID NO:2); SEQ ID NO: 17 and SEQ ID NO: 18; 
SEQ ID NO:31 and SEQ ID NO:32 (for amplification of the polymorphic region 
corresponding to position 194 of SEQ ID NO:2); SEQ ID NO: 19 and SEQ ID 
NO:20; and SEQ ID NO:25 and SEQ ID NO:26 (for amplification of the 
polymorphic region corresponding to position 880 of SEQ ID NO:2.) 

30 In another specific embodiment, the kit of the invention comprises a pair of 

oligonucleotide primers suitable for amplifying the polymorphic region 
corresponding to position 36 of the CYP2D6 gene 5' flanking region as set forth in 
SEQ ID NO:2, a primer pair suitable for amplifying the polymorphic region 



corresponding to position 620 of the CYP2D6 gene 5' flanking region as set forth in 
SEQ ID NO: 2; a primer pair suitable for amplifying the polymorphic region 
corresponding to position 942 of the CYP2D6 gene 5' flanking region as set forth in 
SEQ ID NO:2; a sequence determination oligonucleotide comprising a sequence 
selected from the group consisting of SEQ ID NO:3; SEQ ID NO: 10; SEQ ID 
NO:36; SEQ ID NO:37; SEQ ID NO:50; SEQ ID NO:57; SEQ ID NO:64; and SEQ 
ID NO:71; a sequence determination oligonucleotide comprising a sequence selected 
from the group consisting of SEQ ID NO:6; SEQ ID NO: 13; SEQ ID NO:42; SEQ 
ID NO:43; SEQ ID NO:53; SEQ ID NO:60; SEQ ID NO:67; SEQ ID NO:74; and a 
sequence determination oligonucleotide comprising a sequence selected from the 
group consisting of SEQ ID NO:8; SEQ ID NO: 15; SEQ ID NO:46; SEQ ID NO:47; 
SEQ ID NO:55; SEQ ID NO:62; SEQ ID NO:69; and SEQ ID NO:76. The primer 
pairs of this embodiment are preferably selected from the group consisting of SEQ 
ID NO: 35 and SEQ ID NO: 18 (for amplification of the polymorphic region 
corresponding to position 36 of SEQ ID NO:2); SEQ ID NO:27 and SEQ ID NO:28; 
SEQ ID NO:29 and SEQ ID NO:30 (for amplification of the polymorphic region 
corresponding to position 620 of SEQ ID NO:2); SEQ ID NO:19 and SEQ ID 
NO:20; and SEQ ID NO:25 and SEQ ID NO:26 (for amplification of the 
polymorphic region corresponding to position 942 of SEQ ID NO:2.) 

In another specific embodiment, the kit of the invention comprises a pair of 
oligonucleotide primers suitable for amplifying the polymorphic region 
corresponding to position 36 of the CYP2D6 gene 5' flanking region as set forth in 
SEQ ID NO:2, a primer pair suitable for amplifying the polymorphic region 
corresponding to position 620 of the CYP2D6 gene 5' flanking region as set forth in 
SEQ ID NO:2; a primer pair suitable for amplifying the polymorphic region 
corresponding to position 880 of the CYP2D6 gene 5' flanking region as set forth in 
SEQ ID NO:2; a sequence determination oligonucleotide comprising a sequence 
selected from the group consisting of SEQ ID NO:3; SEQ ID NO: 10; SEQ ID 
NO:36; SEQ ID NO:37; SEQ ID NO:50; SEQ ID NO:57; SEQ ID NO:64; and SEQ 
ID NO:71; a sequence determination oligonucleotide comprising a sequence selected 
from the group consisting of SEQ ID NO:6; SEQ ID NO: 13; SEQ ID NO:42; SEQ 
ID NO:43; SEQ ID NO:53; SEQ ID NO:60; SEQ ID NO:67; SEQ ID NO:74; and a 
sequence determination oligonucleotide comprising a sequence selected from the 
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group consisting of SEQ ID NO:7; SEQ ID NO:14; SEQ ID NO:44; SEQ ID NO:45; 
SEQ ID NO:54; SEQ ID NO:61; SEQ ID NO:68; SEQ ID NO:75. The primer pairs 
of this embodiment are preferably selected from the group consisting of SEQ ID 
NO:35 and SEQ ID NO: 18 (for amplification of the polymorphic region 
5 corresponding to position 36 of SEQ ID NO:2); SEQ ID NO:27 and SEQ ID NO:28; 
SEQ ID NO:29 and SEQ ID NO:30 (for amplification of the polymorphic region 
corresponding to position 620 of SEQ ID NO:2); SEQ ID NO: 19 and SEQ ID 
NO:20; and SEQ ID NO:25 and SEQ ID NO:26 (for amplification of the 
polymorphic region corresponding to position 880 of SEQ ID NO:2.) 

10 The kit of the invention may optionally include primer pairs for amplification 

of the polymorphic region corresponding to position 385 of SEQ ID NO:2, such 
primer pairs being selected from the group consisting of SEQ ID NO:29 and SEQ ID 
NO:30, and SEQ ID NO:33 and SEQ ID NO:34. The kit of this embodiment also 
comprises a sequence determination oligonucleotide selected from the group 

15 consisting of SEQ ID NO:5; SEQ ID NO:12; SEQ ID NO:40; SEQ ID NO:41; SEQ 
ID NO:52; SEQ ID NO:59; SEQ ID NO:66; and SEQ ED NO:73. 

The kit of the invention may further optionally include primer pairs for 
amplification of the polymorphic region corresponding to position 1255 of SEQ ID 
NO:2, such primer pairs being selected from the group consisting of SEQ ID NO:21 

20 and SEQ ID NO:22, and SEQ ID NO:23 and SEQ ID NO:24. The kit of this 

embodiment also comprises a sequence determination oligonucleotide selected from 
the group consisting of SEQ ID NO:9; SEQ ID NO:16; SEQ ID NO:48; SEQ ID 
NO:49; SEQ ID NO:56; SEQ ID NO:63; SEQ ID NO:70; and SEQ ID NO:77. 

25 The examples set forth below are provided as illustration and are not 

intended to limit the scope and spirit of the invention as specifically embodied 
therein. 
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EXAMPLE 1 
PHENOTYPES OF STUDY PARTICIPANTS 

The study was performed in accordance with the principles stated in the Declaration 
of Helsinki as reviewed in Tokyo 1975 and Venice 1983, Hong Kong 1989 and 
5 Somerset West 1996. Subjects were preferably not related to each other. Based on 
questioning, individuals having one of the following were excluded: a medical 
condition judged to influence liver function or requiring pharmacological treatment; 
any on-going disease; intake of any drug, except oral contraceptives, during one 
week prior to the study; breast-feeding or pregnancy. No physical examination was 

10 performed. For these experiments, a single oral dose of 10 mg debrisoquine 
(Declinax, Hoffman-LaRoche) was taken in the evening before bed-time. The 
bladder was emptied before drug intake. All urine was then collected overnight 
(about 8 hours) A single blood sample was collected 3 hours after drug intake. 

In the first part of the study, 88 samples (Swedish Caucasians) were selected 

15 as set forth in Table 1, on the basis of the following assumptions: if the distribution 
of an unknown polymorphism will be 25% for a homozygote, a sample size of 
approximately 40 "UEM" will be able to detect an increase in this specific genotype 
(homozygote) by 28% (oc=5% (two-tailed), power=80%). If it is assumed that the 
distribution of an unknown polymorphism will be 10% for a homozygote, a sample 

20 size of approximately 40 "UEM" will be able to detect an increase in this specific 
genotype (homozygote) by 21% (cc=5% (two-tailed), power=80%). 

Individuals with UEM phenotype caused by CYP2D6-gene duplication were 
excluded. Individuals with known defective alleles, Le. CYP2D6*3, CYP2D6*4 and 
CYP2D6*5 were excluded. CYP2D6*6 was also excluded where data was available 

25 (and due to its low allele frequency among Caucasians (1.8%) additional *6 

genotyping was not applied as a standard procedure). However, a few extra samples 

genotyped for any of the alleles mentioned above were included as oudier controls. 

Table 1 



# of samples 


MR 


Phenotvpe 


47 


<0.2 


"UEM" 


26 


0.2-0.8 


"fast EM" 


11 


0.8-12.6 


"slow EM" 


4 


>12.6 


"PM" 



30 

The first part of the study resulted in identification of seven SNPs in the 5' 
flanking region of the CYP2D6 gene. Oligonucleotides containing these SNPs are 
shown in Table 2. 
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Table 2 



Polymorphic 
Site 


Sequence 


Nucleotide change 


36 

(-1496) 


SEQ ID NO:3: GAACCCGGTCT 
SEQ ID NO: 10: GAACCGGGTCT 


C variant 
G variant 


194 

(-1338) 


SEQ ID NO:4: AAAATACAAAAAG 
SEQIDNOrll: AAAATATAAAAAG 


C variant 
T variant 


JOJ 

(-1147) 


crn rr\ Mfi.C A A A A A n A ATT Ann. 

SEQ ID NO: 12: AAAAAGGATTAGG 


A variant 
G variant 


620 
(-912) 


SEQIDNO:6: AGGACGACCCT 
SEQ ID N0:13: AGGACAACCCT 


G variant 
A variant 


880 
(-652) 


SEQ ID NO:7: TGTGCCCTAAG 
SEQ ID NO: 14: TGTGCTCTAAG 


C variant 
T variant 


942 
(-590) 


SEQIDNO:8: TCTGCGTGTGT 
SEQ ID NO:15: TCTGCATGTGT 


G variant 
A variant 


1255 
(-277) 


SEQIDNO:9: TGGCCGGGTCC 
SEQ ID NO: 16: TGGCCAGGTCC 


G variant 
A variant 



In the second part of the study, samples with a more normative phenotypic 
distribution were used. Also, no exclusion of individuals with known defective 
alleles or duplications was done. Table 3 sets forth the phenotypic distribution of 
144 samples used in the second part of this study. 



Table 3 



# of samples 


MR 


Phenotvpe 


15 


<0.2 


"UEM" 


78 


0.2-0.8 


"fast EM" 


33 


0.81-12.6 


"slow EM" 


18 


>12.6 


"PM" 
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EXAMPLE 2 

CYP2D6 GENETIC ANALYSIS 
White blood cells isolated from a blood sample drawn from the brachial vein 

serve as the source of the genomic DNA for the analyses. The DNA was extracted 

by guanidine thiocyanate method or QIAamp Blood Kit (QIAGEN, Venlo, The 

Netherlands). The genes included in the study were amplified by PCR and the DNA 

sequences were determined by the technology most suitable for the specific 

fragment. All genetic analyses were performed according to Good Laboratory 

Practice and Standard Operating Procedures. Case Report Forms were designed and 

used for clinical and genetic data collection. Data was entered and stored in a 

relational database at Gemini Genomics AB, Uppsala. To secure consistency 

between the Case Report Forms and the database, data was checked either by double 

data entry or proofreading. After a Clean File was declared the database was 

protected against changes. By using the program Stat/Transfer™ the database was 

transferred to SAS data sets. The SAS™ system was used for tabulations and 

statistical evaluations. Genotypes and haplotypes were correlated against the 

metabolic ratio. 

PCR-fragments were amplified with TaqGOLD polymerase (Applied 
Biosystems) using Robocycler (Stratagene) or GeneAmp PCR system 9700 
(Applied Biosystems). Preferentially, the amplified fragments were 300-400 bp, and 
the region to be read did not exceed 300 bp for full sequencing and did not exceed 
60 bp for One Base Sequencing (OBS). PCR reactions were carried out according to 
the basic protocol set forth in Table 4, with modifications as indicated in Table 5 for 
specific primer pairs, which are shown in Table 6. For the GeneAmp PCR 9700 
machine the profile used was 10 minutes at 95°, 40 x (45 seconds at 90°, 45 seconds at 
60°, 45 seconds at 72°), 5 minutes at 72° and 22° until removed. 
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Table 4 



Solution 


Stock Concentration 


PCR <jd) 


H 2 0 




33.2 


PCR buffer 


lOx 


5.0 


MgCl 2 


25 mM 


2.0 


dJNTP 


2.5 mM 


2.5 


primer 1 


10\M 


1.0 


primer 2 


10pM 


1.0 


Taq-gold 
polymerase 


5 jo/jjlI 


0.3 


DNA sample 




5.0 


TOTAL 




50.0 



Table 5 



SEQID 
NO:s 


Polymorphic 
Site 


Modification from basic protocol (Table 3) 


SNP Detection method 


35, 18 


36 


64° annealing temperature 


Full sequencing & OBS 


35, 18 


36 


62° annealing temperature 


OBS 


17, 18 


194 


62° annealing temperature 


Full sequencing & OBS 


31,32 


194 


58° annealing temperature, 50 cycles 


Full sequencing 


33, 34 


385 


3 jxl MgCl 2) 58° annealing temperature 


Full sequencing & OBS 


29, 30 


385 & 620 


None 


Full sequencing 


27, 28 


620 


62° annealing temperature, 50 cycles 


Full sequencing & OBS 


19,20 


880 & 942 


62° annealing temperature 


Full sequencing & OBS 


25,26 


880 & 942 


None 


Full sequencing & OBS 


21,22 


1255 


3uJMgCl 2 


Full sequencing 


23,24 


1255 


62° annealing temperature 


Full sequencing & OBS 
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Table 6 


Polymorphic Site 


Primer Pair 


194 


SEQ ID NO: 17; AAATACAAAATTAGCTGGGATTG 
SEQ ID NO:18: GAGACGGAGATTTCCTCTTGT 


880 & 942 


SEQ ID NO: 19: CCTTCCGGCTACCAACTG 
SEQ ID NO:20: TTGCAGGGACACGATTACAC 


1255 


SEQ ED NO:21 : TAAGGGTGCTGAAGGTCACTC 
SEQ ID NO:22: GGGCTGCTCCAGAGGTTC 


1255 


SEQ ID NO:23: CCAGGTAAGTGCCAGTGACA 
SEQ ID NO:24: AGCTCCTGAAGCCTGCAAAG 


880 & 942 


SEQ ID NO:25: GCCAGAGCCCAGGAATGT 
SEQ ID NO:26: GCCTTGCCCTTTCCCTAC 


620 


SEQ ID NO:27: AGAAACATGGAGGCCAGAA 
SEQ ID NO:28: GTTTCCTGGATGGGACCAC 


385 & 620 


SEQ ID NO:29: AGCCTAGAGGTGAAGGTTGTAG 
SEQ ID NO:30 CTTGCCCCAGCCTGTGA 


194 


SEQ ID NO:3 1 : AAAAAATACAAAATTAGCTGGGATT 
SEQ ID NO:32: TTTTTTTTTGG AG AC GG AG AT 


385 


SEQ..ID NO: 33: TTCTTTAGACAGGGTCTCACTCT 
SEQ ID N0.34: GGGCAACAAGAGGAAATCT 


36 


SEQ ID NO:35: GCCTGGACAACTTGGAAGA 
SEQ ID NO: 18: GAGACGGAGATTTCCTCTTGT 



The optimized conditions specified in Table 5 were required to distinguish 
CYP2D6 from the two closely related pseudogenes CYP2D7P and CYP2D8P. Use of 
the basic PCR protocol may lead to problems when amplifying C7P2Z)(5-specific 
amplicons of 300-400 bp containing the polymorphisms of interest, unless a nested 
PCR approach is carried out. The nested PCR approach was not used because of the 
high risk of contamination when using a nested PCR approach and the high risk of 
typing errors as a consequence. The modifications shown in Table 5 were optimized 
and reaction parameters were balanced in such a way that nested PCR was avoided. 

The PCR conditions set forth in Table 5 also relate to the method used to 
detect the polymorphisms in the amplified DNA samples. Since the amplicons of 
300-400 bp containing the polymorphisms of interest are CYP2D6-$pecific, they 
serve as a selection step for the less specific sequence determination 
oligonucleotides (set forth in Tables 2, 7, 8 and 9). This is especially critical for 
typing polymorphisms 880, 942 and 1255, since the CKP2D6-sequence between 
position 771 and 1270 (see SEQ ID NO:2) share 99% identity to CYP2D7P. 

For full sequencing, one of the PCR-primers in a primer pair was designed 
for sequencing by addition of a 29 nucleotide tail complementary to Ml 3 at its 5'- 
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end, namely the nucleotides AGTCACGACGTTGTAAAACGACGGCCAGT. 

Thus, the entire PCR-product was sequenced from the tailed PCR-primer. 
The OBS method as used herein is described in commonly assigned 

international patent application number PCT/GB01/00828. Briefly, the OBS 
5 method is a mini sequencing/primer extension variant, which uses a unique mixture 

of three dNTPs and one ddNTP. A sequencing primer is positioned adjacent or close 

to a polymorphic position, e.g., a SNP. The extension from the sequencing primer 

annealed to a single stranded PCR product continues until a ddNTP is incorporated. 

For example, when detecting an A/C SNP using a ddATP terminator, the extension 
10 will stop at the SNP if an A is present but will continue to the next A in the sequence 

if a C is present. Thus, a heterozygote sample will produce two extension products 

of different defined lengths (see Figure 3). 

The additional oligonucleotides set forth in Tables 7 through 9 were 

identified as being suitable for detection of the SNPs at positions 36, 194, 385, 620, 
15 880, 942, and/or 1255 of the 5' flanking region of the CYP2D6 gene as depicted in 

SEQIDNO:2. 

Table 7 sets forth oligonucleotides representing the non-coding (anti-sense) 
strand complementary to the polymorphic region corresponding to the 
polymorphisms found in the study population. The underlined letter indicates 
20 polymorphic position in the sequence context. Numbers inside brackets are 
calculated from the transcriptional start. All sequences are shown in 5' to 3' 
direction. 
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Table 7 



Polymorphic 
Site 


Sequence 


Note 


36 

(-1496) 


SEQIDNO:36: AGACCGGGTTC 
SEQIDNO:37: AGACCCGGTTC 


Antisense G variant 
Antisense Cvariant j 


194 

(-1338) 


SEQ ID NO:38: CTTTTTGTATTTT 
SEQ ID NO:39: CTTTTTATATTTT 


Antisense G variant 
Antisense A variant 


385 
(-1147) 


SFO TD NO* AO' PPT A A TTPTTTTT 

SEQIDNO:41: CCTAATCCTTTTT 


Antisense T variant 
Antisense C variant 


620 
(-912) 


SEQIDNO:42: AGGGTCGTCCT 
SEQIDNO:43: AGGGTTGTCCT 


Antisense C variant 
Antisense T variant 


880 
(-652) 


SEQIDNO:44: CTTAGGGCACA 
SEQIDNO:45: CTTAGAGCACA 


Antisense G variant 
Antisense A variant 


942 
(-590) 


SEQIDNO:46: ACACACGCAGA 
SEQIDNO:47: ACACATGCAGA 


Antisense C variant 
Antisense T variant 


1255 
(-277) 


SEQIDNO:48: GGACCCGGCCA 
SEQIDNO:49: GGACCTGGCCA 


Antisense C variant 
Antisense T variant 



The sequences of Table 8 represent the 5 '-sequence to the polymorphic sites 
on the coding (sense) strand (SEQ ID NO:s 50-56) and non-coding (anti-sense) 
strand (SEQ ID NO:s 57-63). The underlined letter indicates polymorphic position 
in the sequence context. Numbers inside brackets are calculated from the 
transcriptional start. All sequences are shown in 5' to 3' direction. 



Table 8 



Polymorphic 
Site 


Sequence 


Note 


36 

(-1496) 


SEQIDNO:50: ACTTGGAAGAA 
SEQ ID NO:57: TTCTTCCAAGT 


Sense 5' 
Antisense 5' 


194 

(-1338) 


SEQ ID NO:5 1 : TCTACTGAAAA 
SEQEDNO:58: TTTTCAGTAGA 


Sense 5' 
Antisense 5' 


385 

(-1147) 


SEQ ID NO:52: CCAAAAAAAAAAAAAAAAAAAAAAG 
SEQIDNO:59: CTTTTTTTTTTTTTTTTTTTTTTGG 


Sense 5' 
Antisense 5' 


620 
(-912) 


SEQ ID NO:53: AGTGGAGGAGG 
SEQ ID NO:60: CCTCCTCCACT 


Sense 5* 
Antisense 5' 


880 
(-652) 


SEQIDNO:54: AGAGAATGTGT 
SEQ ID NO:61: ACACATTCTCT 


Sense 5' 
Antisense 5* 


942 
(-590) 


SEQ ID NO:55: GGTGATTTTCT 
SEQIDNO:62: AGAAAATCACC 


Sense 5' 
Antisense 5' 


1255 
(-277) 


SEQ ID NO:56: GAGGTGGATGG 

SEQ ID NO:63: CCATCCACCTC 


Sense 5* 
Antisense 5* 



The sequences of Table 9 represent the 3'-sequence to the polymorphic sites 
on the non-coding (anti-sense) strand (SEQ ID NO:s 64-70) and the coding (sense) 
strand (SEQ ID NO:s 71-77). Underlined letter indicates polymorphic position in the 
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sequence context. Numbers inside brackets are calculated from the transcriptional 
start. All sequences are shown in 5' to 3' direction. 



Table 9 



PfYI vmnrnh i c 

Site 


Sequence 


Note 


36 

(-1496) 


SEQIDNO:64: TTTTGTAGAGA 
SEQIDNO:71: TCTCTACAAAA 


Antisense 3' 
Sense 3' 


194 

(-1338) 


SEQ ID NO:65: CGTCTAGCTTT 
SEQIDNO:72: AAAGCTAGACG 


Antisense 3' 
Sense 3' 


385 
(-1147) 


SEQIDNO:66: CACCCAGCCTA 
SEQ ID NO:73: TAGGCTGGGTG 


Antisense 3* 
Sense 3' 


620 
(-912) 


SEQ ID NO:67: GCTGCCTGAGG 
SEQIDNO:74: CCTCAGGCAGC 


Antisense 3' 
Sense 3' 


880 
(-652) 


SEQ ID NO:68: CACTGACACTT 
SEQ ID NO:75: AAGTGTCAGTG 


Antisense 3' 
Sense 3' 


942 
(-590) 


SEQ ID NO:69: ACACGATTACA 
SEQ ID NO:76: TGTAATCGTGT 


Antisense 3' 
Sense 3' 


1255 
(-277) 


SEQ ID NO:70: GITTCAGTGGA 
SEQIDNO:77: TCCACTGAAAC 


Antisense 3' 
Sense 3' 



EXAMPLE 3 
HAPLOTYPE AND GENOTYPE ANALYSES 

Haplotype analysis could be performed on a total of 232 individuals. This 
analysis was performed using software based on maximum likelihood methodology 
and using the EM algorithm of Excoffier et al. (1995), Mol Biol Evol. 12:921-927. 
In total 5 likely haplotypes were identified by the program. One of these occurred 
only six times in the study population and has been excluded from the study due to 
its low frequency. The characterization of each haplotype is presented in Table 10, 
and the frequency of each haplotype is set forth in Table 1 1 . From the haplotype 
information two different kinds of variables were created: one variable was formed 
as a haplotype combination variable (HTYPE). This variable has the value H1/H2 
when the subject has haplotypes 1 and 2, etc. Variables HI, H2, H3 and H4 are 
haplotype annotations that denote the number of copies of that particular haplotype 
for the subject, e.g., for a subject with haplotype H1/H2 the variables HI, H2, H3 
and H4 will be 1, 1, 0 and 0, respectively. Each of these variables can thus take on 
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the values 0, 1 or 2. Only the four most frequent haplotypes were considered when 
those variables were formed. 



Table 10 



Haplotype 


Nucleotide at polymorphic position: 


36 


194 

1: 


385 
A 


620 
G 


880 
C 


942 
G 


1255 
G 


M33388 

(GcnBank) 

SEQIDNOrl 


C " 


HI 

(CCAGCGG) 


C 


c 


A 


G 


c 


G 


G 


H2 

(GCGGTAG) 


G 


c 


G 


G 


T 


A 


G 


H3* 

(CCGGTAG) 


C 


c 


G 


G 


T 


A 


G 


H4* 

(CTGACGG) 


c 


T 


G 


A 


C 


G 


G 



Table 11 





Haplotype 


Haplotype 


P-value (Sp) 


Note 










frequency 












HI 


46% 


0.0001 


HI/HI 


n=53 


mr50=0.22 










Hl/- 


n=108 


mr5Q=0.375 










-/- 


n=71 


mr50=0.87 




H2 


27% 


0.0001 


H2/H2 


n=19 


mr50=0.3 










H2/- 


n=88 


mr50=0.335 










-/- 


n=125 


mr50=0.56 


r 


H3 


8% 


0.0012 


H3/H3 


n=3 


mr50=2.08 










H3/- 


n=31 


mr50=0.64 










-/- 


n=198 


mr50=0.34 


CI 


H4 


17% 


0.0001 


H4/H4 


n=16 


mr50=88 










H4/~ 


n=49 


mr50=0.86 










-/- 


n=167 


mr50=0.27 
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Table 1 1 also sets forth the statistical p-values (Spearman correlation) 
between CYP2D6 haplotypes H1-H4 and mr(debrisoquine), where mr50 is an 
abbreviation for metabolic ratio of the 50 th percentile. 

Table 12 sets forth a summary of the predictive haplotypes found in the study 
15 described in Examples 1 and 2. 



Table 12 



Haplotype 


Metabolic capacity 


Note 


HI 


UEM&EM 


H1/H2 is faster (UEM/EM) 


H2 


UEM&EM 


H1/H2 is faster (UEM/EM) 


H3 


IM 




H4 


PM 


In 99% LD with CYP2D6*4 
(80 samples/81 samples) 



25 



Table 13 shows CYP2D6 genotype markers for haplotype combinations and 
their predicted metabolic ratios based on 232 samples. It should be noted that the 
method of the invention may use detection of only three SNPs in the 5' flanking 
region of the CYP2D6 gene, since position 2D6:194 can be replaced with position 
2D6:620, and position 2D6:942 with position 2D6:880 with the same resolution 
power as shown in Table 13, 



Table 13 



| CYP2D6 genotype 


HTYPE 


Marker for 


MR (Debr) 


% of haplotypes 
in MR-range 


MR-range 
(min-max) 


2D6:36 


2D6:194 


2D6:942 


C/G 


C 


A/G 


H1/H2 


UEM/EM 


<0.4 


81% (52/64) 


0.06-1.04 


C 


C 


G 


HI/HI 


UEM&EM 


<0.8 


89% (49/55) 


0.03-110 


G 


c 


A 


H2/H2 


UEM&EM 


<0.8 


89% (17/19) 


0.13-1.44 


C/G 


C 


A 


H2/H3 


EM & IM 


0.2-7.0 


86% (6/7) 


0.13-2.29 


C 


C 


A/G 


H1/H3 


EM 


0.4-2.0 


74% (14/19) 


0.08-2.40 


C/G 


C/T 


A/G 


H2/H4 


EM 


0.4-2.0 


82% (14/17) 


0.33-3.70 


C 


C/T 


G 


H1/H4 


EM & IM 


0.4-7.0 


74% (20/27) 


0.18-143 


C 


C 


A 


H3/H3 


IM 


0.8-7.0 


100% (3/3) 


0.85-5.17 


C 


C/T 


A/G 


H3/H4 


IM 


0.8-7.0 


100% (5/5) 


1.46-6.54 


c 


T 


G 


H4/H4 


PM 


>12.6 


100% (16/16) 


26.3-236 



While the invention has been described in terms of the specific embodiments 
set forth above, those of skill will recognize that the essential features of the 
invention may be varied without undue experimentation and that such variations are 
within the scope of the appended claims. 



26 



